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Abstract 

We consider portfolio selection when decisions based on a dynamic 
risk measure are affected by the use of a moving horizon, and the possible 
inconsistencies that this creates. By giving a formal treatment of time 
consistency which is independent of Bellman's equations, we show that 
there is a new sense in which these decisions can be seen as consistent. 



1 Introduction 

Risk is an active area of study. The management of uncertain outcomes, and 
decision making in this context, is of considerable importance. Much recent 
research has focussed around properties of 'coherent risk measures', as first 
discussed in [Ij, and 'convex risk measures', as defined by [12] and [13]. These 
are functionals p : L^{J-t) — > K, where T is some future time, and L'^{J-t) is 
the space of integrable J-y-measurable random variables. In the convex case, it 
is assumed that these functionals satisfy three assumptions, namely: 

1. Monotonicity: X > y,P-a.s. => p{X) < p{Y), 

2. Translation invariance: p{X + c) — p{X) — c for all c G M, 

3. Convexity: p{XX + (1 - X)Y) < Xp{X) + (1 - A)p(r) for all A e [0, 1]. 

One significant flaw with these risk measures is that they are essentially static 
- they consider only one random outcome, and do not model the development 
of information through time. Simply applying these risk measures to a multiple- 
period problem is insufficient, as there is no guarantee that they will lead to time- 
consistent decision making. In particular, there is no guarantee that Bellman's 
principle will be satisifed. Concrete examples of this can be found in [17] and 
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More recently, Artzner et al. [5] discussed how a particular expression of 
Bellman's principle is equivalent to a recursivity property of the risk measures, 
namely if Pt{X) denotes the risk of X as considered at time then for any s < t, 
we have Ps{X) = ps{~pt{X)). In jTS], an equivalent property, (given translation 
invariance), is considered. Specifically, in a type of inter-temporal mono- 
tonicity is assumed, that is, for any times s < t, pt{X) > Pt{Y) P-a.s. implies 
PsiX) > Ps{Y) P-a.s. In this paper, we show that a version of this monotonicity 
condition is equivalent to a general form of Bellman's principle; see Theorem 

Much has been written on dynamic risk measures, that is, risk measures 
where a recursivity property is satisfied. See, for example, [25], [3], [10]. Sim- 
ilarly, a theory of 'time-consistent nonlinear expectations' has been developed. 
See particularly [52] and the references therein. These satisfy assumptions very 
similar to those of dynamic risk measures, the main difference being a sign 
change in each of the three assumptions above. To construct these function- 
al, a common tool is the theory of Backward Stochastic Differential Equa- 
tions (BSDEs), and it is known that all nonlinear expectations, (satisfying some 
constraints), can be expressed as solutions of BSDEs, (see ^ and [T5] in the 
continuous time case, [5] and [7] in the discrete time case). 

To apply these methods, one must typically fix a distant point T in the 
future, (possibly infinitely distant), at which all payoffs will be realised. Alter- 
natively, as for example in [53] or [3], one can generalise the risk measures to 
operators pa,T ■ L°°{^a) L°°{^t) where a < t are stopping times. If we 
assume that t <T for some fixed T, we can then replace r with T throughout, 
by the property pa^riX) — pa,T{X) for all X £ i°°(7>). 

In many investment applications, predicting even the distribution of ex- 
tremely long-term behaviour is almost impossible. One might hope to use a 
shorter-dated moving horizon, where the portfolio value at some fixed time into 
the future, (say, one-year from the present), is considered, but this horizon is 
allowed to move forward as time progresses. That is, the risk is calculated based 
on the portfolio value a short time in the future, rather than at the terminal time 
T. Hence, if Vt is the portfolio value at time t, our risk at time t is measured 
by ptiVt+m), where m is the horizon distance. As Vs+m 7^ Vt+in in general, it 
is clear that the recursivity properties imply no relationship between ps{Vs+m) 
and ps{—ptiVt+rn)), and an approach to time-consistency based on the recur- 
sivity property of p is insufficient. In this paper, we discuss what consistency 
properties remain under such a regime. 

These and similar questions have also led to the forward-performance ap- 
proach of Musiela and Zariphopolou and others (for example, [IH], [20], [57]V 
Here a self-generation property is used to ensure that, under optimal behaviour, 
Ps{Vt) is independent of t for t > and so no problem arises. However, this 
requires that p is of a very special type, and depends on the model used of the 
market. We here consider the consequences of simply assuming that p satisfies 
the standard recursivity property. 

This work is also motivated by applications in economic regulation. In many 
risk management settings the 'risk' is calculated over some finite horizon, to 
ensure it does not exceed certain bounds. For example, in the Basel II Banking 
accords, regulators calculate a ten-day 99%-value-at-risk for market risk, and 
a one-year 99.9%-value-at-risk for credit and operational risks. See [TB] for 
more details. Even if these risks are calculated using a dynamic risk measure. 
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(which, as is well known, value- at-risk is not), the moving horizon will introduce 
inconsistencies into the analysis. 

Time inconsistent problems have been classically studied in economics, for 
example, in the works of [H], [5T], [53] and [55] or more recently in [TT] and 
[5]. The approach used in these papers is based on solving an intertemporal 
game. In particular, the 'optimal' strategy is selected subject to the requirement 
that there will be no benefit from deviating from it at any point in the future. 
Clearly, determining such strategies generally requires explicit consideration of 
future behaviour. This approach to choosing strategies is not of key interest for 
the moving horizon problem. The reason for this is that if one were to consider 
the actions one will take tomorrow, one would have to consider behaviour up 
to tomorrow's horizon, that is, one day further than the horizon considered 
today. In this case, one may as well consider this more distant horizon directly. 
A recursive argument then shows that this would result in the horizon being 
extended into the distant future, and therefore, the moving horizon problem 
would essentially disappear. 

For this reason, we consider the situation where decisions are made in a 
completely naive manner, without regard for future behaviour. Our question 
is whether this approach will yield time-consistent policies, which will clearly 
depend on those policies available and the values assigned to them. We shall 
show that, for a simple dynamic investment problem, decision making with a 
moving horizon is not time-consistent in general. We shall then show that there 
exists a modified version of time-consistency which is satisfied, given certain 
assumptions on the possible policy space. 

For simplicity, we shall work in a discrete time setting. The continuous 
time setting is conceptually similar, and we expect that many of the results 
obtained will carry over, with appropriate technical modification. However, 
there are significant difficulties in working with moving horizons in continuous 
time, some of which are explored in [B]. 

We proceed by first considering the fundamental notions of time-consistency, 
and derive an appropriate variant for our problem. In Section|2]we then formally 
introduce the particular problem under consideration, and show that the classi- 
cal requirements of time-consistency are not satisfied; however, the requirements 
for our modified concept are. 

2 Time-consistency and policies 

We now present a general definition of time-consistency, which is essentially 
a formalisation of Bellman's Principle of Optimality. While taking Bellman's 
Principle as a useful basis for a definition of time consistency, we shall not 
assume that the value function is the solution of Bellman's equation and, hence, 
the problems considered may not be time-consistent. 

In general, we assume that there is a set of allowable policies which are 
adapted processes taking values in some metric space U. They are selected to 
optimise some value function V, which is in general a family of maps 

Vt-M^L\Tt). t e {o,i,...r}. 

For simplicity, we take higher values of V as better than lower. 
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Definition 2.1. Let X gU be a policy. We define the conditional policy space 

at t given past policy X , 

U\f = {X' &U:X'^ = Xs P - U.S. for all s < t]. (1) 

Note that = U for all X . 

Let {X*} be a collection containing a policy choice X* G U for each time 
t <T. Let X denote that policy which is eventually chosen, that is Xu '■= X^. 
Then this policy choice is viable if, for every s <t, 

X* G U\f 

or equivalently 

x*€u\^=f]u\r=u\r\ 

s<t 

To ensure that in different states of the world different decisions can be 
independently made, we have the following property. 

Definition 2.2. We say the conditional policy space satisfies the pasting prop- 
erty if for any past policy X gU, 

IaX + IacX' & U\f for all X,X' & U\f ,A& Tt. (2) 

We say the value function V satisfies the zero-one law if 

Vt{lAX + Ia^X') = UVtiX) + lA^Vt(X') (3) 

for all X,X' €U\f, AeJ'f 

Intuitively, wc think of X* as the policy which an investor intends to pursue, 
when making a selection at time t. A collection being viable ensures that X, the 
policy that is finally chosen, does not involve an investor attempting to change 
their past actions at any time. 

Note that this definition requires that the past policy is matched both in the 
observed past and in all possible other pasts (that is, for all uj). This requirement 
is needed to ensure that switching, at time t, from one policy X to another in 
h{\^ results in a policy which is mU. 

The following result ensures that X is in fact a policy, that is, X gU. 

Lemma 2.1. If {X*} is a viable policy choice, then X^ = X. Hence X gU. 

We now give simple conditions under which our problem has a solution. 

Our main focus is not on deriving conditions for the solution to exist, but on 
exploring the implications of the solution for time-consistency; therefore, the 
restrictive nature of these conditions is not a major concern. It is easy to see 
that our main results all have appropriate modifications to more general settings 

whenever the existence of optimal policies is given. 

Definition 2.3. In general, we shall say that our problem is standard if 

• hi is a compact subset of adapted processes on U, with induced metric 
d{X,X') = J2s E![di!{Xs, Xg)]. For any X,X' G U, this metric satisfies 

d{X, X') =0 if and only if X, = X'^ P - a.s. for all s. 
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Note that we do not assume that ¥-almost sure convergence is metrizable. 
A simple example is a compact subset of adapted processes taking values 
Xt e Li(K^; j;) for allt. 

• For all t, the value function Vt '■ U ~^ L^{J-t) is lower semicontinuous 
under the metric topology, that is, if X" X°° and Vt{X") < Vt{X"+^) 
P-a.s. for all n, then lini„^oo Vt{X") = Vt{X°°). 

• For all t, lA\f as defined by (QJ) satisfies the pasting property (0) and Vt 
satisfies the zero-one law 

Lemma 2.2. If U is compact, then U\f is compact for all times t and all 
policies X . 

Proof. For any sequence {-'^"} in U\f , we know that X^ — Xu for all u <t. As 
lA is compact, there exists a convergent subsequence of X". This subsequence 
has a modification in lA\f by the pasting property, and as we are in discrete 
time these modifications are indistinguishable. Therefore, lA\f^ is sequentially 
compact and hence compact. □ 

Theorem 2.1. For a standard problem, for all t and any past policy X , there 
exists a policy X* S ^It*" such that for any X €U\f 

Vt{X')>Vt{X), a.s. 

We shall say that the policy X* given by Theorem \2.1\ uniformly maximises 
Vt onU\^. 

Proof. This is classical result from the assumptions of a standard problem. □ 

Definition 2.4. Let {X*} be a viable collection containing a policy choice for 
each time t. This generates a realised policy X , defined by Xt — Xf — Xj for 
all t. This collection is called optimal if 

(i) for any t, the policy X^ uniformly maximises Vt{X) for X £ , 

and time consistent if 

(a) for any time t, we have 

Vt(X*) = VtiX), F-a.s. 

Remark 2.1. Unlike most interpretations of Bellman's principle, this definition 
is 'forward looking', and does not, in general, admit the use of dynamic pro- 
gramming as a means of finding optimal policies. 

Definition 12.41 directly allows initial behaviour to affect future behaviour in 
complex ways. This idea is embedded in the assumption of viability and the 
freedom to specify which policies are in U. That is, this approach allows pre- 
commitment contracts and similar devices to be modelled, through restrictions 
on 14. The definition, therefore, looks for consistent behaviour contingent on 
what has already been done. 
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Essentially, if we choose an optimal policy today, we simply need to check 
that, in the future, we shall continue to follow a policy which we consider equiv- 
alent to the optimal choice today. In some sense, a policy is time consistent if 
it leads to a ^commitment to previous decisions\ 

This definition has the distinct disadvantage of not requiring us to ensure 
that our decisions today will make us happy in the future. The policies selected 
as optimal in the future only need to lie in , that is, in the space of policies 
we have left ourselves to choose from. A simple example of this is when the 
space U consists only of 'buy-and-hold' policies. Here we make a decision at 
time zero, and are unable to modify it at any point in the future - the policy 

chosen at time zero is the only policy vi\U\f = for alH > 0. Hence this 
decision is time consistent, as no deviation from the initial plan is permitted. 

Remark 2.2. It is important to note that, if {X*} is optimal, for any i, as X* 
maximises Vt and 

Property (ii) of Definition 12.41 can only ever fail through a future decision ap- 
pearing sub-optimal today, that is, it is always true that 

Vt{X') > VtiX), P-a.s. 

2.1 Bellman's principle and time-consistency 

We now give a relation between time consistency in the sense of Definition 12.41 
and a type of intertemporal monotonicity for the value function. For simplicity, 
we write [s,t[ for the discrete collection of times {s,s-|-l, 1}, and similarly 

for ]s, t]. 

This result is closely related to results of Artzner et al. f?. Our approach 
differs from theirs mainly in the attention given to the space of possible policies 
U. 

Theorem 2.2. The following statements are equivalent 

(i) The value V is such that every optimal policy choice is also time- consistent, 
for every initial compact policy set hi of adapted processes in U. 

(ii) For any adapted processes X, X' taking values in U and any times s < t, if 
Xu = X'^ for all u G [0,t[ and Vt[X) > Vt(X') a.s. then Vs{X) > Vs{X') 
a.s. 

Proof. We interpret all (in-)equalities as P-a.s. 

(i implies ii.) Assume our policy space is given hy U — {IaX + Ia^X' : A G 
Tt}. Note that as Xu = X'^ for all u € [0,i[, we have W/^ = U^^ = Uf. Then 
at time i, if Vt{X) > Vt{X') we will find X* = X is an optimal policy. This 
implies that X — X, a.s Xu ~ for u E [0,t[. Hence by time consistency, 

V.(X') < VsiX') = Vs{X) - Vs{X). 

(ii implies i.) Let s be the first time that Vt(X*) = Vt{X) for all t > s. By 
Lemma [Q. s < T. As {X*} is optimal and X'^+i e Ulf" , we know 

V.+i(X^) < V,+i(X^+i) = V.+i(X). 
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By (ii), this implies that Vs{X^) < Vs{X). As is optimal, it follows that 
V,{X^)=VAX). 

Therefore, if Vt(X*) = Vt{X) for all t > s, then Vs(X'*) = VsiX). By 
induction, this must hold for all times, that is, the optimal choice is consistent. 

□ 

Corollary 2.2.1. The value function given by Bellman's equation is time con- 
sistent for any initial policy set U. 

Proof. Let /(w, s, Xg) be the payoff at time s of following policy Xg- Bellman's 
equation then gives, for a fixed policy X, the value function 

VsiX) ^ E[f{Lu,s,Xs) + Vs+i{X)\Tsl 

By recursion, given past policy X, this clearly implies that Vs{X) is a functional 
only of {Xu}uG[s,t[ and Vt{X), and hence, statement (ii) of Theorem 12.21 is 
satisfied. □ 

Corollary 2.2.2. The value functions given by dynamic risk measures and 
nonlinear expectations are time consistent for any initial policy setlA. 

Proof. In this context, the policy X determines a (stochastic) terminal value 
Vr^ . Our nonlinear expectation/dynamic risk measure the yields the value 

By the recursivity and monotonicity properties of nonlinear expectations/dynamic 
risk measures, we can write Vs{X) as a nondecreasing functional of the future 
values Vt{X). Hence statement (ii) of Theorem 12.21 is satisfied. □ 

2.2 Dependable decisions 

We now propose a new type of 'time consistency', which we call 'dependability'. 
One can characterise classical time-consistency through the statement 'a policy 
X is time consistent if the policies chosen in the future, pasted together with 
X, give the same value today as X does.' 

Our new definition would then read, 'a policy X is dependable if the policies 
chosen in the future, pasted together with X, give higher values today than 
policy X does'. In some sense, dependable policies are those that form a lower 
bound on the value function, irrespective of future decisions. 

Intuitively, we suppose that, at any given time, a decision maker can only 
consider a subset of all possible plans, and will select the optimal policy from 
this subset. As time progresses, more plans can be considered, and so preferable 
alternatives may arise. 'Dependability' is then a notion of time-consistency 
which allows for these new alternatives. 

Definition 2.5. Consider a standard problem. Suppose that, for each time 
t > 0, we only consider policies restricted to some compact subset IA\^ C . 
Assume U\f satisfies the pasting property 

Let {X*} be a viable collection of policies AT* e for s < t, such that 
A* e for each t. Note in general A* ^ U\f for s <t. 

This collection is called Z// -optimal if 
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(i) for any t, the policy X* uniformly maximises Vt{X) for X g , 
and dependable if 
(ii) for any time t, we have 

Vt{X^) < VtiX), F-a.s. 

Remark 2.3. As highlighted by Remark 12.21 when U\f- =U\f , this will degen- 
erate into the usual time-consistency properties. Here, on the other hand, our 
restricted set of policies , over which we optimise at each time point, can 
make our problem time- inconsistent. 

Remark 2.4. As U\f is compact and satisfies the pasting property the 
other 'standard' properties of V show the existence of a policy X* uniformly 
maximising Vt onU\^ . 

Note that as we have now restricted the set of policies which we can consider 
at any time point, the result of Theorem 12.21 no longer applies. 

Under this definition, it is perfectly reasonable that a nai've policy may be 
selected early on. However, when it is reconsidered later, this decision might 
be changed. The difference is that this decision is 'dependable' if, had we been 
allowed to initially consider the decision with the later change, we would have 
preferred it to the policy initially chosen. 

This 'dependable' approach to time-consistency is a natural one for problems 
where only a subset of possible policies can be considered at each time. We 
shall see that the problems induced by the moving-horizon approach to risk 
measurement are of this type. 

3 An investment policy model 

We now move to the specific problem of consistency of decisions based on a 
moving horizon. 

Consider a probability space based on a classical model of a financial market 
in discrete time. We assume that all positions will be closed out at or before 
some distant deterministic time T. Hence, time can be indexed by the set 

{o,i,...,r}. 

We suppose that there are d risky assets {S**} defined on some complete 
filtered probability space (fi, J", {J^i}, P). We assume that St G L'^{J^t) for 
all i, where S denotes the vector of risky asset prices. We also assume the 
existence of a 'risk-free' asset, however, for simplicity, we shall assume that the 
risk free interest rate is zero. Equivalently, we assume all quantities have been 
appropriately discounted. We assume that there are no transaction costs. 

A firm wishing to invest in this market has a range of self-financing policies 
available, which is a subset U of the adapted processes in = U. We assume 
that W is a compact subset of adapted processes {X : Xt G i^(R''; J-t)}- 

An investor's wealth process V''^ satisfies the stochastic difference equation 

V,l,^{XuSt+i-St) + V/'. (4) 

(The risk-free asset could also be included, but as we assume the risk-free interest 
rate is zero, it would not affect the dynamics of V'^ .) For notational simplicity. 
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we extend V beyond time T by setting — for all u > T. Note that a 
policy Xt describes the choice to be made under every contingency, and is not 
required to be Markovian or of feedback form. 

We now state the following general definition, due to Peng (eg [35], [23] )• 

Definition 3.1 (Nonlinear Expectations). A system of operators 

£{-\Tt) : L\Ft) ^ L\Tt) 

is called an !Ft- consistent nonlinear expectation if it satisfies the following prop- 
erties. 

1. (Monotomcity) IfQ > Q' P-a.s. then£{Q\Tt) > EiQ'lJ't), with£{Q\Tt) = 
£{Q'\Ft) only if Q ^ Q' V-a.s. 

2. (Constant invariance) For Q G L^{J-t), £(Q\Tt) = Q- 

3. (Recursivity) For any s < t, £{£{Q\Tt)\Fs) = £{Q\Ts) P-a.s. 
^. (Zero-one law) For any A G J-t, lA£{Q\^t) — £{lAQ\J't)- 

Remark 3.f. In [8 and [7], we have given a representation result for these 
operators in discrete time on finite horizons using the theory of BSDEs. These 
results give a complete description of nonlinear expectations (and the more 
general class of nonlinear evaluations) in this context. These results are not 
germane to the present work so we shall simply proceed by assuming that a 
nonlinear expectation is given. 

At each time t, an investor wishes to choose the 'time-i-optimaP policy 
X* G U\f . We shall model their decision as based on a value function given 
by a dynamic risk measure pt{-)^ or equivalently, by an J^f -consistent nonlinear 
expectation £{-\J^t)- That is we have the following problem: 

Definition 3.2. The simple moving horizon problem with horizon m is to find 
a viable policy choice {X*}, where for each t, X* uniformly maximises 

Vt{X) ^ £{V,lJTt) = -pt{V{i„,) 

for X &U\f . 

We emphasise at this point that we have chosen our value function such that 
the time-inconsistency in this problem arises purely because of the short horizon. 
The nonlinear expectation itself is time-consistent, in the sense of 2 . However, 
the nonlinear expectation is not being evaluated on the terminal values, which, 
as we shall see, leads to inconsistencies. 

3.1 An equivalence for policies 

We now show that this problem is, in general, equivalent to a dependable prob- 
lem. 

Definition 3.3. We define 

For a given horizon m, we say that lA is closed under truncation if, for all times 
t, all past policies X, 
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The following Lemma is trivial to prove, however forms the basis for the 
desired equivalence. 

Lemma 3.1. At any time t, for any policy X € /[g t+„,[Z//|f^ we have the identity 

VtiX) = £{VtlJTt) = £{V^\Tt) V-a.s. 

Furthermore, for any policy X G U, any time t, 

Vt{X) = Vt{Iio,t+n4X) P-a.s. 

Definition 3.4. The modified moving horizon problem with horizon m is to 
find a viable policy choice {X*}, where for each t, X* uniformly maximises 

for X e I[o.t+rnim? ^--i^^- 

Remark 3.2. We can now consider the 'moving horizon problem' in two distinct 
ways. Either 

• We take the value function Vt{X) = £(V('^„JJ^t), in which case we have a 
time-inconsistent problem, or 

• we take the value function Vt{X) — £{y^\J-t), and then require that our 
selection X* must lie in the set I\^Q^t+m\f^\t' ^Sich. t. 

By Lemma 13.11 we can assume, without loss of generality, that the policy X 
which maximises Vt will lie in this set, and for all such policies we have Vt{X) — 
Vt{X). 

That is, we can consider the moving horizon in terms of a restriction on the 
policy space, rather than in terms of evaluating the wealth process V at the 
moving horizon. The values associated with each policy under these alternative 
approaches will be identical. 

We can now give the following positive result for the moving horizon problem. 

Theorem 3.1. Any U-optimal solution to the modified moving horizon problem 
is dependable. 

By Lemma \3.1l when lA is closed under truncation, this will give the same 
values and policy choices at all times as when using a moving horizon. 

Proof. For each t, we choose X* to maximise Vt{X^) ~ S{V,^ \J^t), for X* e 
. We also know that, for any X e U\f_^^ = ^|^\, we have Vt+i{X) < 
Vt+i(X*+^). Hence, by the monotonicity of nonlinear expectations, we know 
that 

Vt{X) < Vt(X*+i) for all X G U\f+i. 
Specifically, this implies 

VtiX') < Vt(X*+i). 

Similarly, it follows that 

Vt-i{X'-') < Vt-iiX') < Vt-iiX'+'). 
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where the last inequaUty is again by monotonicity of nonlinear expectations. 
By induction, this argument shows that for any times s < t, 



VsiX') < V,(X*). 
Hence, for all s < T, as by Lemma I^TTI X = X'^ , we have the result 

VsiX')<V,iX). 

□ 

Remark 3.3. Note that the requirement on U is that, in some sense, it does 
not enforce commitment, specifically that one can always choose to 'quit at the 
horizon', that is, to take the truncated policy /[q 

4 A dependable but inconsistent example 

To demonstrate the usefulness of these results, we give a simple, if contrived, 
example of a situation where the moving horizon approach is inconsistent, but 
the equivalent approach using a modified policy space is dependable. 

Suppose our market contains only one asset S. The policy space U consists 
of those processes X of the form X^ = Iu<a where tr is a stopping time. 

Let T = 3, and suppose that values are given by the nonlinear expectation 

£(Q|J-0--101ogi?[e-«/i"|J-t]. 

This is evaluated on a horizon two periods from the present, that is, m — 2, and 

VtiX)^£ {Vtl,^\Tt). 

Let S follow a non-recombining binomial tree, with independent increments 
given by 

So = 20 
5*1 — 5*0 



Si = 



1 


w.p. 


0.5 


-0.1 


w.p. 


0.5 


0.1 


w.p. 


0.5 


-10 


w.p. 


0.5 


100 


w.p. 


0.5 


-0.1 


w.p. 


0.5 



Sn — 5*2 — 



Here w.p. denotes 'with probability'. 

It is then easy to see that, in every state of the world uj, the policy chosen 
at each time will be: 



1 t = 
t>0 



Xl = X'^ = a.s. for all t 
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and therefore = 1 a.s. for all t. Comparing these at time 0, we have 

VoiX°) = 0.1889 > -2.4926 = Vo{X), 

and so our optimal solution is not time-consistent. 

On the other hand, at any time t, the permitted polices allow the choice 
Xu = for u > t. That is, U is closed under truncation, in the sense of Theorem 
13.11 Hence we know that this decision is dependable, under an equivalent value 
function. To show this empirically, we define 

VtiX)^£ {V^\Tt) 
and instead consider, at each time, policies in the restricted set 

I[a.t+mii^\t' — ■ 

On this set, from Lemma [XTl we know Vt{X) — Vt{X), and that a policy which 
uniformly maximises Vt will lie in this set. We obtain exactly the same optimal 
policies, but have the values 

Vo(^") = 0.1889 < 0.4741 = Vo(^), 

and so see that (given X^ — X^ = X^ = X) our choice is dependable. Note 
that Vo{X°) ^ Vo(^°), as expected. 

5 Acceptable decisions 

Often the question of interest is whether a policy X is acceptable, that is, has a 
value above a critical level. (Equivalently, has a risk below a critical level.) For 
simplicity, we consider this decision at time t — 0. Suppose that this critical 
value is given by Vo(0), the value associated with the 'null' policy X = 0. Our 
concern that this value function is time-inconsistent, hence we could plan, at 
time t — 0, to follow policy X, but not follow through with it in the future. In 
this event, we require a guarantee that the truncated policy which we eventually 
follow, X, yields an acceptable value today. 

Now suppose that, for a policy X under consideration, we define the space 
of available policies 

U = {I[q^t-[X\t a stopping time}. 

That is, we suppose that could change from following policy X to the policy 
at any stopping time r. This policy space is clearly closed under truncation and 
is compact. It follows that the optimal policy choice in using the modified 
moving horizon value V, is dependable. 

Therefore, if X is acceptable, that is, Vq{X) = Vo(^) > 0, then we can be 
sure that the optimal realised policy X satisfies 

Vo(^) > Vo(^°) > Vo(^) > 

where X'^ is the time-zero optimal policy in U. For this reason, when it is 
possible for a position to be 'sold off' at any time, we can be confident that 
future actions will not act to decrease the value/increase the risk assigned to a 
policy today, at least under an equivalent value function V. 
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6 Conclusions 



We have discussed the theory of time-consistency, and have given a definition 
for a new type of property, that of 'dependabihty'. We have shown that, for a 
simple model of a financial market, under some assumptions on the allowable 
policies, the optimal decision reached using a moving horizon approach is equal 
to an optimal dependable decision using an equivalent value function. 

This result gives a partial justification for using a moving horizon approach 
in risk management. Assume that one can always decide to stop investing at the 
horizon, (that is, to take the policy I[o^t+m[^)- Then one can be sure that the 
optimal policy today, considering only a finite horizon, will only be improved 
by future decisions. 

This analysis still assumes that the underlying value function used is re- 
cursive up to the horizon, in particular, that it is an J^i-consistent nonlinear 
expectation. This could be weakened to assuming that it is simply a nonlinear 
evaluation, and with appropriate adaptation of the arguments involved, we can 
also remove the assumption that interest rates are zero or deterministic. How- 
ever, if the value function used is not recursive, for example, as with Coherent 
Value at Risk, these results would not apply. Essentially this is because these 
value functions introduce different types of time inconsistency, apart from the 
issues of moving horizons. 

Given the extreme uncertainties that may be faced when attempting to 
model asset dynamics in the very long term, it may be appropriate to use a 
moving horizon approach. At the same time, if decisions involve commitment 
beyond the horizon, (and hence the policy space is not closed under truncation, 
in the sense of Theorem 13. ip . consideration of the longer term is necessary. 
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